Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org
نویسندگان
چکیده
This paper introduces a novel evaluation methodology for entity resolution algorithms. It is motivated by PatentsView.org, U.S. Patents and Trademarks Office patent data exploration tool that disambiguates inventors using an algorithm. We provide collection tailored performance estimators account sampling biases. Our approach simple, practical principled -- key characteristics allow us to paint the first representative picture of PatentsView's disambiguation performance. used inform users reliability comparison competing
منابع مشابه
Lessons learned through leadership.
Part of the Rehabilitation and Therapy Commons This Article is brought to you for free and open access by the Jefferson Digital Commons. The Jefferson Digital Commons is a service of Thomas Jefferson University's Center for Teaching and Learning (CTL). The Commons is a showcase for Jefferson books and journals, peer-reviewed scholarly publications, unique historical collections from the Univers...
متن کاملAchievements of the Cochrane Iran Associate Centre: Lessons Learned
Healthcare decision-making is a process that mainly depends on evidence and involves increasing numbers of stakeholders, including the consumers. Cochrane evidence responds to this challenge by identifying, appraising, integrating and synthesizing high-quality evidence. Recently, a collaborative effort has been initiated in Iran with Cochrane to establish a representati...
متن کاملMentoring through teamwork: lessons learned.
This essay is simply a highly personal account of how one mentor has joined with a team of mentors, combined with special "permanent" employees, lively group interactions and high expectations for trainees to provide a fertile environment for the training of scientists. I also need to acknowledge the deep personal friendships that have developed and intensified with the Rankin Lab trainees and ...
متن کاملCrowdsourcing Algorithms for Entity Resolution
In this paper, we study a hybrid human-machine approach for solving the problem of Entity Resolution (ER). The goal of ER is to identify all records in a database that refer to the same underlying entity, and are therefore duplicates of each other. Our input is a graph over all the records in a database, where each edge has a probability denoting our prior belief (based on Machine Learning mode...
متن کاملFacilitating Knowledge Sharing Through Lessons Learned System
Recently, many organizations realize that knowledge is a strategic tool for maintaining organizational performance. With the realization that knowledge is a core resource, organizations are now attempting to manage knowledge in a more systematic and more effective way. The theory of organizational knowledge creation suggests the sharing of tacit knowledge is a critical component of successful k...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The American Statistician
سال: 2023
ISSN: ['0003-1305', '1537-2731']
DOI: https://doi.org/10.1080/00031305.2023.2191664